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ABSTRACT 

It has been widely claimed that several lines of observational evidence point towards a "down- 
sizing" of the process of galaxy formation over cosmic time. This behaviour is sometimes 
termed "anti-hierarchical", and contrasted with the "bottom-up" (small objects form first) as- 
sembly of the dark matter structures in Cold Dark Matter models. In this paper we address 
three different kinds of observational evidence that have been described as "downsizing": the 
stellar mass assembly (i.e. more massive galaxies assemble at higher redshift with respect to 
low-mass ones), star formation rate (i.e. the decline of the specific star formation rate is faster 
for more massive systems) and the ages of the stellar populations in local galaxies (i.e. more 
massive galaxies host older stellar populations). We compare a broad compilation of available 
data-sets with the predictions of three different semi-analytic models of galaxy formation 
within the ACDM framework. In the data, we see only weak evidence at best of "downsizing" 
in stellar mass and in star formation rate. Despite the different implementations of the phys- 
ical recipes, the three models agree remarkably well in their predictions. We find that, when 
observational errors on stellar mass and SFR are taken into account, the models acceptably 
reproduce the evolution of massive galaxies (M > 10 11 M Q in stellar mass), over the en- 
tire redshift range that we consider (0 ^ z ^ 4). However, lower mass galaxies, in the stellar 
mass range 10 9 — 10 11 Mq, are formed too early in the models and are too passive at late 
times. Thus, the models do not correctly reproduce the downsizing trend in stellar mass or the 
archaeological downsizing, while they qualitatively reproduce the mass-dependent evolution 
of the SFR. We demonstrate that these discrepancies are not solely due to a poor treatment 
of satellite galaxies but are mainly connected to the excessively efficient formation of central 
galaxies in high-redshift haloes with circular velocities ~ 100 — 200 km/s. We conclude that 
some physical process operating on these mass scales — most probably star formation and/or 
supernova feedback — is not yet properly treated in these models. 
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1 INTRODUCTION 

In the last decades, the par ameters of the cosm ological model 
have been tightly constrained dKomatsu et alj|2009l . and references 
therein), and the CDM paradigm has proved to be very success- 
ful in reproducing a large number of observations, particularly on 
large scales. The current standard paradigm for structure formation 
predicts that the collapse of dark matter (DM) haloes proceeds in a 



"bottom-up" fashion, with smaller structures forming first and later 
merging into larger systems. It has long been known that galaxies 
do not share the same 'bottom-up' evolution, at least in their star 
formation histories. The most massive galaxies — mainly giant el- 
lipticals hosted in galaxy groups and clusters — are dominated by 
old stellar populations. In contrast, faint field galaxies appear to 
have continued to actively form stars over the last billion years, 
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and their stellar populations are dominated by young stars. This ev- 
idence is not necessarily in contrast with the hierarchical clustering 
of DM haloes as it relates to the "formation" of the main stellar 
population of a galaxy, which does not necessarily coincide with 
the "assembly" of its stellar mass and/or the assembly of its parent 
dark matter halo. 

In the last decade, much observational effort has been devoted 
to quantifying the dependence of galaxy format ion and assembly on 
stellar mass. In one of the earliest such studies. ICowie et al I fl996h 
showed that the maximum rest-frame A'-band luminosity of galax- 
ies undergoing rapid star formation in the Hawaii deep field de- 
clines smoothly with cosmological time. Cowie and collaborators 
coined the term "downsizing" (hereafter DS) to describe this be- 
haviour. Since then, the same term has been extended to a number 
of observational trends suggesting either older ages, earlier active 
star formation, or earlier assembly for more massive galaxies with 
respect to their lower mass counterparts. Using the same word to 
describe very different kinds of observational results has naturally 
generated some confusion. The underlying thought has clearly been 
that these observations are all manifestations of the same underly- 
ing physical process. It is not clear to which degree this is in fact 
the case, nor to what degree these observational trends are "anti- 
hierarchical", i.e. whether they are in fact in serious conflict with 
predictions from models based on ACDM cosmology. 

It is useful, at this point, to summarise the different types of 
"downsizing" that have been discussed in the literature. Clearly, 
each of the observational evidences discussed below has its own 
set of uncertainties and potential biases. Here, we report the trends 
as they have been claimed in the literature, and discuss in more de- 
tail the related uncertainties and caveats later. The first two types of 
DS that we describe are based on the local "fossil record" and are 
related to the time of "formation" of the stellar population, i.e. they 
tell us that the bulk of the stars in more massive galaxies formed 
earlier and on shorter time-scales than in their lower mass counter- 
parts. These two types of DS are: 

Chemo-archaeological DS: among elliptical galaxies, more 
massive objects have higher (up to supe r -solar ) [q/ Fe] ratios. This 
result was first reported by iFaber et al] d 19921) and IWorthev et al.l 
dl992l) . who suggested three possible (and equally acceptable at 
that time) explanations: (i) different star formation time-scales; (ii) 
a variable Initial Mass Function (IMF); (iii) selective mass loss 
mechanisms. Several studies ha v e since confirmed t his observa- 
tional t rend dCarollo et al.lll993l : iDavies et all 1 19931 ; iTrager et al.l 
l2000bl : | Kuntschner et al.l 1200 lh . and a standard interpretation 
has become that of shor ter formation time-scales in more lu - 
minous/massive galaxies dMatteuccil 1 19941 : iThomas et alj 120051) . 
though other interpretations have not been conclusively ruled out. 

Archaeological DS: more massive galaxies host older stellar 
populations than lower mass galaxies. A direct estimate of stellar 
ages is hampered by the well-known age-metallicity degeneracy 
(e.g. lTrager et"al]|2000al and references therein), although it has 
long been known that there are some spectral features (like Balmer 
lines) that are more sensitive to age than to metallicity (see i.e., 
IWorthey|[T994h . Recent detailed analyses, based on a combination 
of spectral indexes or on a detailed fit of the full high-resolution 
spectrum, have con firmed a weak trend between stellar m ass and 
age both in clusters dThomas et al . 2005; N elan et al]|2005L though 
see Trager et al. 2008), and in the field dTrager et al.l l2000al : 
iHeavens et al.ll2004l : lGaTlazzi et alfeOOllPanter et al.||2007l) . 

The second kind of observational evidence for DS comes from 
"lookback studies", or observations of galaxies at different cosmic 
epochs: 



DS in (specific) SFR: the mass of 'star-forming galax- 
ies' decli nes with de c reasing redshift. This trend was first 
seen by ICowie et alj d 19961) and there have been many 
claimed confirmations b y subsequent deeper an d /or wider ob- 
serva ti onal programmes l Brinchmann et al. 120041; iKodama et al 



2004 1; iFeulner et al] 120051: iBauer et alj |2005|; IPapovich et al 



2006 ; iBundv et alj 120061 ; IPannella et alj I2OO0I : iBell et all 12007 
aLJ , I U ' . , I L„Ij I. Ill 



Noeske et al. 2007; Drory & Alvarez 2008; Vergani et al 



2008 



Chen et all 120091 ; ICowie & Barger 2008J). This trend can also be 



recast as implying that the SFR density or specific star formation 
rate declines more rapidly for more massive systems; here there 
are conflicting claims in th e literature about whether such a trend 
is in fact seen or no t (e.g. Juneau et alj 20051 ; IZheng et alj|2007l ; 
IConselice et~al]|2007l : iMobasher et alj|2009h . This trend reflects in 
SSFRs of nearby spiral ga laxies which are higher for lower mass 
objects dBoselli et alj200ll) . A possibly related trend is the increase 
with time of faint red-sequence galaxies in galaxy clusters (see, 
e.g.. |Pe Lucia et alj|2004 l200l iGilbank et alj|2008l) . which may 
be due to a differential decline of the SSFR. 

DS in stellar mass: the high-mass end of the stellar mass 
function (hereafter MF) evolves more slowly than the low-mass 
end, indicating that massive galaxies were assembled earlier 
than less massive ones. The same result is found both by cor- 
recting the B-b and or K-band lum inosity functions for "pas- 



lar mass using multi-wavelength photometry dDrorv et al. 


2004, 


20051: Bundv et al. 


20061: Borch et alj 20061: Fontana et al. 


2006; 


Pozzetti et alj 12007 


; IConselice et alj 120071; Perez-Gonzalez et al. 


2008); Marchesini et al. 2008). The significance of these claims has 



been recently questioned bv lMarchesini et al.ld2008 

DS in metallicity: the stellar metallicity of more massive 
galaxies appears t o decrease with redshift more slowly than for less 
massive galaxies dSayaglio et alj|2005l : lErb et alj|2006l : | Ando et alj 
120071 ; iMaiolino et alj l2008h . It is important to note, however, 
that often different indicators are used at different redshifts, and 
that there are large un certainties in the metallicity calibration 
dKewlev & Ellisonl2008l) . 

DS in nuclear activity: the number density of Active Galac- 
tic Nuclei (AGNs) peaks at higher redshift when brig hter objects 
are considered. This trend is found both f or X-ray dUeda et alj 
20031: lHasinger et alj|2005l) and for optically dCristiani et alj|2004l : 



Fontanot et al .120071) selected AGNs, but it strongly d epends on the 
modeling of obscuration (e.g. jLa Franca et alj2005l) . 

Downsizing trends have often been considered "anti- 
hierarchical", suggesting expected and/or demonstrated difficul- 
ties in reconciling the observed trends with predictions from hi- 
erarchical galaxy formation models. The naive expectation is that, 
like for dark matter haloes, galaxy formation also proceeds in a 
bottom-up fashion with more massive systems "form ing" later. It 
has already been pointed out in early theoretical work dBaugh et alj 
1 1 9961 : iKauffmannl 1 1 9961) that the epoch of formation of the stars 
within a galaxy does not necessa rily coincide with the epoch of 
the galaxy's a ssembly. Moreover, iNeistein et alj d2006h (see also 
iLi et alj|2008h suggested that a certain degree of "natural down- 
sizing" is actually expected in the CDM paradigm if one assumes 
that there is a minimum halo mass that can support star forma- 
tion and considers the integrated mass in all progenitor haloes 
rather than just that in the main progenitor. However, several au- 



thors ( Cimatti et al .l2006l , lFontana et a l. 2006; F ontanot et alj2006l: 



ICirasuolo et"ail 2008) have argued that the observed mass assem- 
bly DS represents a challenge for modern hierarchical galaxy for- 
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mation models. As well, CDM models have been unablaH to re 



produce the observ e d chemo-archaeo l ogical DS i Thomas! 1 19991: 



iThomas et aD 120051: INagashima et all 120051: IPipino et al.l koolh 
and lSomerville et al.l ( 120080 and lTrager & Somervilld |2008J) have 
shown that the modern generation of models does not quantitatively 
reproduce the archaeological DS trend in the field or in rich clus- 
ters. 

Ea rly phenomeno l ogical mod els of joint galaxy-AG N forma- 
tion by iMonaco etal] fcOOd) and iGranato et all ( feOOll) produced 
"anti-hierarchical" formation of elliptical galaxies in ACDM halos 
by delaying quasar activity in less massive halos. More recently, it 
has been suggested that A GN feedback could provide a solution t o 
the "downsizing problem" dBower et~al . 2006; C roton et alj|2006h . 
The suppression of late gas condensation in massive haloes gives 
rise to shorter forma tion time-scales for more massive galaxies 

Lucia et al. I l2006h . in qualitative agreement with the obs erved 
trends. However, the recent work bv lSomerville et al.l ( 120081) indi- 
cates that the predicted trends may not be as strong as the observed 
ones, even in the presence of AGN feedback. 

Moreover, AGN feedback does not stop the growth in stellar 
mass via mergers. ACDM models predict that the stellar masses of 
the most massive galaxies have increased b y a factor two or more 
since z ~ 1 via gas-poo r "dry mergers' ' |5e Lucia et al. I l2006l : 
|Pe Lucia & Blaizotll2007l) . It has been suggested that if mergers 
scatter a significant fraction of the stars in the progenitor galaxies 
into a "diffuse stellar component", then perhaps one can reconcile 
the CDM predictions with the o bserved weak evolution in the stel- 
lar mass function since z ~ 1 dMonaco"eT al, 2006; Conrov et al .1 
120071 ; ISomerville et al.|[200^> . but observational uncertainties on 
the amount of diffuse light are still too large to strongly constrain 
models of this process. 

Despite the large number of papers related to the subject of 
"downsizing", a detailed and systematic comparison between a 
broad compilation of observational data and predictions from hi- 
erarchical galaxy formation models is still missing. Our study is 
a first attempt in this direction. We present here predictions from 
three different semi-analytic models (see f|2), all of which have 
been tuned to provide reasonably good agreement with the ob- 
served properties of galaxies in the local Universe, and compare 
them to an extensive compilation of recent data on the evolution 
of the stellar MF, SSFRs, and SFR densities, as well as with ob- 
servational determinations of stellar population ages as a function 
of mass in nearby galaxies. An important new aspect of our study 
is that we consider three (claimed) "manifestations" of downsizing 
simultaneously. Because these very different kinds of observations 
have very different potential selection effects and biases, this al- 
lows us to make a strong argument that, when a discrepancy is seen 
between the models and all three kinds of observations, this dis- 
crepancy is due to shortcomings in the physical ingredients of the 
models rather than errors or biases in the observations. Similarly, by 
making use of three independently developed semi-analytic mod- 
els, which include different implementations of the main physical 
processes, we can hope to determine which conclusions are robust 
to model details. 

In this study, we do not address the "Chemo-archaeological 



DS" or the "DS in metallicity". Our chemical enrichment models 
are all based on an instantaneous recycling approximation which 
prevents us from making detailed comparisons with observed ele- 
mental abundances. We have also decided not to discuss here the 
"DS in AGN activity", which depends strongly on the complicated 
and poorly understood physics of accretion onto black holes and on 
its re l ation to star formatio n activity (see, e.g.. iMenci et alj|2004l 
l2008l ; lFontanot et alj|2006h . 

The paper is organized as follows: in ij2]we give a brief intro- 
duction to the models we use in our study. We then present our re- 
sults for the DS in stellar mass (fJ3}, in SFR (fj4), and on the archae- 
ological DS (*j5). In ij6] we discuss our results and give our con- 
clusions. Throughout this paper we assume a cosmological model 
consistent with the WMAP3 results. 



2 MODELS 

We consider predictions from three independently developed codes 
that use semi-analytic modeling (SAM) techniques to simulate the 
formation of galaxies w ithin the ACD M cosmogony (for a review 
on these techniques see lBaughll2006l) . In SAMs, the evolution of 
the baryonic component of galaxies — which are assumed to form 
when gas condenses at the center of DM haloes — is modeled us- 
ing simple but physically motivated analytic 'recipes'. The param- 
eters entering these analytic approximations of the various phys- 
ical processes are usually fixed by comparing model predictions 
to observational data of local galaxies. Although the treatment of 
the physical processes is necessarily simplified, this technique al- 
lows modelers to explore (at least schematically) a broad range of 
processes that could not be directly simulated simultaneously (e.g., 
accretion onto SMBH on sub-pc scales within the framework of 
cosmological structure formation), and to explore a wide parame- 
ter space. 

Most of the various SAMs proposed in the literature are at- 
tempting to model the same basic set of physical processes. When 
a comparison is made of several SAMs with observations, one may 
focus on differences between models, with the aim of understand- 
ing how the details of a particular implementation influence the 
predictions of galaxy properties. Alternatively, one may concen- 
trate on comparing the model predictions with the observational 
data. In this case the focus shifts to assessing whether the general 
framework, namely ACDM + the set of physical processes imple- 
mented, gives a plausible description of galaxy populations. 

In this paper we take the second approach. We use three 
SAMs: (i) the most recen t implementation of the Munich model 
jDe Lucia & Blaizotll2007h with its genera lization to the WMAP3 
cosmology discussed in Wang et all 1 2008L hereafter WD L08); (ii) 
the MORGANA model, presented in lMonaco et al.l d2007l) . adapted 
to a WMAP3 cosmology, and wi t h som e minor improvements 
which will b e presented inlLo Farol d2009l) . (iii) the fiducial model 
presented in lSomerville et alj d2008[ hereafter S 08), which builds 
on th e previous implementation discussed in ISomerville et al.l 
d200ll). Al l mode ls adopt, for the results discussed in this study, 



Chabrierl (§003) IMF. 



1 It should be noted that the work by iThomaj \ 19991) was not fully 
self-consistent as they used star formation histories from a semi-analytic 
model in a "closed-box " chemical enrichment model. The later work by 
INagashima et alj fe005l) has, however, confirmed the difficulties in repro- 
ducing the observed trends. 



In the following we briefly summarize the main physical in- 
gredients of SAMs, and then highlight the main differences be- 
tween the implementations of these ingredients in the three models 
used here. For more details we refer to the original papers men- 
tioned above and to the references therein. 

We first summarize the elements that are common to all three 
models. The backbone of all three SAM's is a "merger tree", which 
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describes the formation history of dark matter haloes through merg- 
ers and accretion. When a halo merges with a larger virialized 
halo, it becomes a "sub-halo" and continues to orbit until it is ei- 
ther tidally destroyed or merges with the central object. Gas cools 
and condenses via atomic cooling, and forms a rotationally sup- 
ported disc. This cold disc gas becomes available for star formation, 
which is modeled using simple empirical (Schmidt-Kennicutt-like) 
recipes. Galaxy mergers trigger enhanced "bursts" of star forma- 
tion. After the Universe becomes reionized, gas infall is suppressed 
in low-mass haloes ( 30-50 km/s) due to the photoionizing back- 
ground. Star formation deposits energy into the cold gas, and may 
re-heat or expel this gas. The production of chemical elements by 
type II supernovae is tracked using a simple instantaneous recy- 
cling approximation with the effective yield taken as a free pa- 
rameter. All three codes also track the formation of supermassive 
black holes, and differentiate between the so-called "bright-mode" 
(or "quasar-mode") which is associated with luminous AGNs, and 
"radio-mode" accretion which is related to efficient production 
of radio jets. The "bright-mode" is associated with galaxy-galaxy 
mergers (WDL08 and S08) or Eddington-limited accretion rates 
(MORGANA), while the "radio-mode" is associated with low ac- 
cretion rates (few percent of Eddington). All three models include 
"radio mode" feedback (heating of the hot gas halo by giant radio 
jets). MORGANA and S08 also include galactic scale AGN-driven 
winds, which can remove cold gas from galaxies. 

All three models are coupled with stellar population synthesis 
models and a treatment of dust absorption, and are capable of pre- 
dicting observable quantities like luminosities and colours in vari- 
ous bands. However, modeling t hese additional ingre dients (espe- 
cially dust, as recently shown bv lFontanot et alj|2009h introduces a 
large number of additional uncertainties and degrees of freedom in 
both the model-model and the model-data comparison. To simplify 
the interpretation of our results, we therefore conduct our entire 
analysis in the space of "physical" quantities (e.g. stellar masses 
and SFRs), which are directly predicted by the models, and may be 
extracted from multi-wavelength observations. 

Here we highlight a few of the differences between the model 
implementations : 



• Cosmological and Numerical Parameters: All three 
semi-analytic models adopt values of the cosmological 
parameters that are consistent with the WMAP3 results 
within the quoted errors. WDL08 use a simulation box 
of 125/!. _1 Mpc 3 on a side, with cosmological parameters 
(n ,n A ,h,a 8 ,n sp ) = (0.226,0.774,0.743,0.722,0.947). 
MORGANA uses a 144 h^ 1 Mpc box, and adopts a cosmology with 
(n ,tt A ,h,(j8,n sp ) = (0.24,0.76,0.72,0.8,0.96). S08 uses a 
grid of 100 realizations of 100 "root" haloes, with circular veloc- 
ities rang ing from 60 km/s to 120 km/s, and weights the results 
with the ISheth & Tormenl i ll 9991) halo mass function. The S08 
model used here assumes the following cosmological parameters 
(n ,^A,h,a 8 ,n sp ) = (0.279,0.721,0.701,0.817,0.96). In all 
cases, the mass resolution is sufficient to resolve galaxies with 
stellar mass larger than 10 9 h^ 1 Mq. The very small differences in 
the cosmological parameters in the three models will have a nearly 
undetectable impact on our predictions, and therefore we make no 
attempt to correct the results for the slightly different cosmologies. 

• merger trees: The WDL08 model uses merger trees extracte d 
from a dissipationless N-Body simulation dSpringel et alj|2005l ). 
MORGANA uses the Lagrangian semi-analytic code PINOCCHIO 
dMonaco et alj2002l) . and S08 use a method based on the Extended 



Press- Schechter formalism, described in ISomerville & Kolattl 
dl999h . 

• substructure: The WDL08 model explicitly follows dark 
matter substructures in the N-body simulation until tidal trunca- 
tion and stripp ing reduce their mass below the resol ution limit of 
the simulation foe Lucia et all2004klGao et alj2004h . Beyond that 
point the merger time of the satellite is computed using the clas- 
sical Cha ndrasekhar dynamical fric tion approximation (for mor e 
details see lDe Luci a & Blaizot 2007| and lDe Lucia & Helmil2008l) . 
MORGANA and S08 do not track explicitly dark matter substruc- 
tures and assume satellite galaxies merge onto central galaxies 
after a dynamical friction time-scale which is assigned at the 
time the satell ite enters the virial radius of the remnant struc- 
ture, following Taffoni et alj J2003h in the case of MORGANA and 
iBovlan-Kolchin et alj | |2008T) in the case of S08. The same models 
also account for tidal destruction of satellites. 

• cooling model: WDL08 and S08 use va riations of the orig- 
inal cooling recipe of IWhite & Frenkl dl99ll) . while MORGANA 
us es a modified mo del, described and tested against simulations 
in lViola et alj | |2008|) . that predicts an enhanced cooling rate at the 
onset of cooling flows. 

• galaxy sizes, star formation, and SN feedback: The three 
SAMs also differ in the details of the modeling of star formation, 
stellar feedback and galactic winds, as well as in the computation 
of galaxy sizes. We prefer not to discuss these processes in detail 
here and we refer the reader to the original papers for more details. 

• BH growth and AGN feedback: In the WDL08 and S08 
models, the radio mode is fueled by accretion from the hot gas 
halo, and only haloes that can support a quasi-hydrostatic halo are 
subject to the radio mode h eati ng (though the condition s used dif- 
fer, see lCroton et alj d200d) and lSomerville et all d2008h ). In MOR- 
GANA the radio mode accretion comes from the cold gas reser- 
voir surrounding the black hole. In the WDL08 and S08 mod- 
els, the "bright mode" or quasar mode is explicitly triggered by 
galaxy-galaxy mergers (though again, the details of the implemen- 
tation differ), while in MORGANA it is associated with Eddington- 
limited high accretion rates (again coming from the cold reservoir). 
As noted above, MORGANA and S08 include galactic scale AGN- 
driven winds associated with the bright mode, while WDL08 do 
not. 

The three models were each normalized to fit a subset of 
low-redshift observations. The specific observations used, and the 
weight given to different observations in choosing a favored nor- 
malization are different for each of the three models, and we refer to 
the original papers for details. The most important free parameters 
in all three models are those controlling the efficiency of supernova 
feedback, star formation, and "radio mode" AGN feedback. The ef- 
ficiency of supernova feedback is primarily constrained by the ob- 
served low-mass slope of the stellar mass function (MORGANA and 
S08) or the faint-end slope of the luminosity function (WDL08). 
The efficiency of star formation is mainly constrained by observa- 
tions of gas fractions in nearby spiral galaxies (WDL08 and S08) 
or by the cosmic SFR density (MORGANA). The efficiency of the 
"radio mode" AGN feedback is constrained by the bright or high- 
mass end of the observed LF or SMF. Other important parameters 
are the effective yield of heavy elements, which is constrained by 
the observed mass-metallicity relation at z — 0, and the efficiency 
of black hole growth, which is constrained by the observed z — 
black hole mass vs. bulge mass relationship. We emphasize that 
we made no attempt to tune the models to match each other, or to 
match any of the high redshift data that we now compare with. 
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Figure 1. Galaxy stellar MF as a function of redshift. Solid, dashed, and dot-dashed lines refer to the MORGANA, WDL08, S08 models respectively. In the 
right panels, all model predictions have been convolved with a Gaussian error distribution on log M+ with standard deviation of 0.25 dex. Green symbols 



correspo nd to observationa l measurements from lPanter et alj <2007l SPSS, z = Ol. l Cole et alj feOOlL 2MA SS, z = 0) iBell et"afl <2003l 2MASS+SDSS, 
z = Ol.lBorchet alJliOQol COMB017, 0.2 < z < l).|Perez-Gonzalez et alj l200a Spitzer, < z < 4), foundy et alj feOOd DEEP2. 0.4 < z < 1.4), 
Drorv et alj fe004l M UNICS. 0.4 < z < 1.2 ).lDrorv et alj feOOSl FD F+GOODS, < z < 5). lFontana et alj feOOd GOODS-MUSIC. 0.4 < z < 4), 
Pozzetti et alj J20071 VVDS, 0.05 < z < 2.5). lMarchesini et alj feobll MUSYC+FIRES+GOODS-CDFS, 1.3 < z < 4). All observational measurements 



have been converted to a Chabrier IMF, when necessary. 



3 DOWNSIZING IN STELLAR MASS 

In this section, we focus on the evolution of the galaxy stellar MF. 
Our model predictions are compared with a compilation of pub- 
lished observational estimates using different data-sets and meth- 
ods to compute stellar masses. In the past, the rest-frame near in- 
frared lig ht has been widely used as a tra cer of the galaxy stel- 
lar mass l lCole et alJl200lilBell et alj|2003h. In more recent times, 
most mass estimates jDrorv et alj|2004 120051: iBundv et al.l | 200C ; 
Borch et al.ll2006|;lFontana et al.ll2006|;|Perez-Gonzalez et all20oi ; 



Marchesini et al. 2008) have been based on multi-wavelength Spec- 
tral Energy Distribution (SED) fitting algorithms. In this approach, 
broad-band photometry is compared to a library of synthetic SEDs, 
covering a relatively wide range of possible star formation histories, 
metallicities, and dust attenuation values. A suitable algorithm is 
then used to select the 'best-fit' solution, thus simultaneously deter- 
mining photometric redshift, galaxy stellar mass and SFR. Stellar 
mass estimates are therefore subject to several degeneracies (age, 
metallicity, and dust) and their accuracy depends sensitively on 
the library of star formation histories employ ed, and on the wave- 
length range covered by observations (see e .g. iFontana et alj|2004l ; 
IPozzetti et aL1l2007l ; lMarchesini et alj|2008l) . In particular, most of 
these algorithms assume relatively simple analytic star formation 
histories (with, in some cases, some bursty star formation episodes 
superimposed), while SAMs typically predict much more complex 
star formation histories, with a non-monotonic behavior and erratic 
bursts. This may result in certa in biases in the physical parame- 
ters obtained from this method l lLee et alj[2008h . Similarly, using 
different libraries of star formation histories has an effect on the fi - 
nal mass determination l lPozzetti et al]|2007l ; Istringer et all2009l) . 



Additional sources of uncertainty may come from the physical in- 
gredients in the adopted stellar population models, e.g. to the treat- 
ment of particular stages of stellar evolut ion, such as TP-AGB stars 
dMaraston et alj200d : lTonini et alj2008l) . Moreover, due to the rel- 
atively small volumes probed at high redshift, cosmic variance due 
to large-scale clustering is a significant source of uncertainty, par- 
ticularly in the number density of high-mass objects. 

When high signal-to-noise spectroscopy is available, galaxy 
stellar masses can be estimated by comparison of th e observed 
spectra with theoretical SEDs (e.g. IPanter et alj|2007l for SDSS 
data). In this case the finer details of the spectrum can be used to 
give tighter constraints on, e.g., stellar ages and metallicities. How- 
ever, the method is not free from uncertainties due to model degen- 
eracies, and contamination from AGN and/or strong emission lines 
(usually not included in the theoretical SEDs) can in principle in- 
troduce systematic errors or simply limit the accuracy of the mass 
estimate. 

In Fig. [TJ we show a compilation of different ob- 
servation al measurements of the galaxy stellar MF from 
2MASS l lCole et alj 1200 ll), 2MASS+SDSS jBell et alj 
MUNICS dDrorv et al.ll2004l), FDF+GOODS JDrorv et alj 



COMBQ17 l lBorch et al.1 |2006[). DE EP2 feundv et al 
GOOD S-MUSIC iFontana et aljl2006t). SPSS jPanter et al 



200 



200 



2006 



2007 



VVDS l lPozzetti et alj|2007l) Spitze r dPerez-Gonzalez et alj|2008 
MUSYC+FIRES+GOODS-CDFS dMarchesini et alj2008h (green 
points; left and right panels show the same data). All estimates have 
been converted to a common (Chabrier) IMF when necessary; we 
use a factor of 0.25 dex to convert from Salpeter to Chabrier. These 
stellar mass functions are fairly consistent among themselves, but 
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the scatter becomes larger at higher redshift, in particular for the 
high-mass tail (which is significantly affected by cosmic variance). 

A note on the errors and uncertainties associated with these 
observationally derived stellar mass functions is in order. Most pub- 
lished papers quote only Poisson errors on their mass function esti- 
mates. However, as noted above, both systematic and random errors 
can arise from the unknown true star formation histories, metal- 
licities, and dust corrections, and also from photometric redshift 
errors, differences in stellar population models, t he unknown stel- 
lar IM F and its evolution, and cosmic variance. iMarchesini et al.l 
d2008l) carry out an extensive investigation of the impact of all of 
these sources of uncertainty on their derived stellar mass functions. 
In their Figures 13 and 14 they show a comparison of their results, 
including these comprehensive error estimates, with the three mod- 
els presented here. Their analysis shows that the evidence for dif- 
ferential evolution in the stellar mass function at 2 < 2, with more 
massive galaxies evolving more slowly than less massive ones, be- 
comes weak when all sources of uncertainty in the stellar mass es- 
timates are considered. When this is done, the observed evolution 
appears to be consistent with pure density evolution. 

In order to make DS in stellar mass more evident, we divide 
galaxies in bins of stellar mass and, by averaging over the MF es- 
timates of Fig Q] compute the stellar mass density of galaxies in 
these stellar mass bins as a function of reds hift. These stellar mass 
den sities agree with estimat es published bv lConselice et al.l d2007l) 
and lCowie & Bargeil d2008l) and are shown in Fig.|2](left and right 
panels contain the same data). The quoted errors refer only to the 
scatter between the estimates from different samples (note that this 
scatter is larger than the quoted errors on individual determina- 
tions, confirming that these errors are underestimated, as discussed 
above). 

DS in stellar mass should consist of a differential growth of 
stellar mass density, such that massive galaxies are assembled ear- 
lier and more rapidly than low mass galaxies. Examining Fig. [2] 
it is hard to claim convincing evidence for such a behavior from 
these data: although the evidence for the growth of stellar mass 
density is clear, its rate of growth is very similar for all mass bins. 
To further illustrate this point we perform a linear regression of the 
Log/9* — z relation in each mass bin; the slopes we obtain are con- 
sistent within their statistical errors. We then rescale the densities 
to the z — value of their regressions and fit the whole sample, 
obtaining the thick dotted line in the left panels of Fig. [2] The fit, 
valid in the 10 9 < Af*/M© < 10 12 range, has a x 2 probability of 
> 95%). 

The predicted stellar MFs for the MORGANA, WDL08 and S08 
models (solid, dashed, and dot-dashed lines respectively) are shown 
in the left panels of Fig.Q]and[2] while in the right panels of the two 
figures, the model stellar masses have been convolved with a statis- 
tical error on log M*. As we have discussed above, this error distri- 
bution depends on many factors, such as the specific algorithm and 
stellar population models used to estimate stellar masses for each 
sample, the magnitude and redshift of the galaxy, and characteris- 
tics of each observational survey, such as the volume covered and 
the number and wavelength coverage of the photometric bands that 
are available. A detailed accounting of this complex error distribu- 
tion for each observational dataset is clearly beyond the scope of 
this paper. Rather than simply ignoring the impact of errors in the 
stellar mass estimates on our data-model comparison, as has usu- 
ally been done in the past, we adopt a simple approach that is meant 
to be illustrative rather than definitive. We assume that the error has 
a Gaussian distribution (independent of mass and redshift) with a 
standard deviation of 0.25 dex. This assumed uncertainty roughly 



corresponds to the mean value of the formal error in t he stellar mass 
determ ination from the GOODS-MUSIC catalogue dFontana et al .1 
|2006|. their Figure 2), i s lower (by about 0. 1 dex) than that estimated 
by Bundv et alj d2006l) . and is roughly consistent with the findings 
of lStringeretalJd2009l) . 

The first thing to note is that the models give fairly consis- 
tent predictions. Secondly, as redshift increases, the intrinsic model 
predictions (i.e without convolution with errors) show a significant 
deficit of massive galaxies (the two bins 10 11 < M*/M Q < 10 11 ' 5 
and M* > 10 Mq) with respect to the data. The error convo- 
lution does not affect the power-law part of the MF, but it has a 
sig nificant impac t upon its high mass tail, as already pointed out 



Baughl J2006 ) iKitzbichler & White] | |2007|) . and more recently 
String er et al. I J2009I) . Because the models were tuned to match 



by 
by 

the z = stellar MF or LF without errors, this convolution causes 
a small apparent overestimate of the number of the most massive 
galaxies at 2 ~ 0. This could be corrected by tuning the radio mode 
AGN feedback in the models. However, there are indications that 
the observed magnitudes and stellar masses of the brightest local 
galaxies may be underestimated by significant amounts (see the dis- 
cussion in S08). Therefore we do not re-tune the models to correct 
this apparent discrepancy. When these observational uncertainties 
are taken into account, model predictions for massive galaxies are 
in fairly good agreement with observations over the entire redshift 
interval probed by the surveys that we considered (with MORGANA 
being ~ 2a low at z > 2). 

In lower stellar mass bins (10 10,5 < Af*/M© < 10 11 and 
in particular 10 10 < M*/M© < 10 105 ), all three models over- 
predict the observed stellar MF and stellar mass density at high 
redshift (z ^ 0.5), with the discrepancy increasing with increas- 
ing redshift. Thus, a robust prediction of the models seems to be 
that the evolution of less massive galaxies is slower than that of 
more massive ones — i.e., the models do not predict stellar mass 
"downsizing" but rather the opposite behavior (sometimes called 
"upsizi ng"). This discrepancy has already been n oted in previous 
papers dFontana et al.ll2004l ; iFontanot et alj|2007t) . and extends to 
other models. Therefore, the models seem to be discrepant with 
observations even if the real universe shows mass-independent den- 
sity evolution rather than downsizing. As a caveat, it is worth men- 
tioning that, although the different groups have performed detailed 
completeness corrections, it is still possible that the high redshift 
samples may be incomplete at the lowest stellar masses. In Fig. Q] 
and|2]we show data only for mass ranges where the corresponding 
authors claim that no completeness correction is necessary. 

In order to further investigate the evolution of the predicted 
galaxy SMF, we consider separately the contribution from central 
and satellite galaxies at different redshifts. Model predictions (con- 
volved with observational uncertainties as before) are shown in the 
left panels of Fig. [3] As a reference, in each panel we show the 
total observed MFs at the considered redshift. The three models 
predict a similar evolution for the two subpopulations. It is evident 
that central galaxies are the main contributors to the overprediction 
of low-mass galaxies at 2 < 2: the models predicts a roughly con- 
stant 2 < 2 number density of low-mass central galaxies, while 
the low-mass satellite population shows a gradual increase which 
is due to the infall of field galaxies into galaxy groups and clusters. 
This implies that small objects are over-produced while they are 
central galaxies, and the excess is not primarily due to inaccuracies 
in the modeling of satellites. 

In the right panels of Fig. [3] we show the evolution of the 
galaxy SMF split in bins of parent halo mass at the considered red- 
shift. Again, the three SAMs predict similar trends: it is evident that 
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Figure 3. Left panel: solid and dashed lines refer to model galaxy SMFs for central and satellite galaxies respectively. Right panel: model galaxy MFs in 
bins of parent DM halo mass. Different models are shown in different columns, as labeled. In both panels, observational measurements of the global SMFs at 
different redshifts are shown as in fig.[T] 




1 2 



Figure 2. Stellar mass density in bins of galaxy stellar mass, as a function of 
redshift. Blue asterisks are estimated from available observational estimates 
of the galaxy stellar MF (see text for details). The thick red line in the left 
panels shows the expected evolution in a pure density evolution scenario. 
Solid, dashed, and dot-dashed lines refer to the MORGANA, WDL08, and 
S08 models respectively. In the right panels, the model predictions have 
been convolved with a Gaussian error on log M* with a standard deviation 
of 0.25 dex. 



the main contributors at all redshifts to the low-mass end excess are 
low-mass (10 11 < M h /M Q < 10 12 ) dark matter halos. These re- 
sults suggest that, in order to cure the discrepancies seen in these 
three models and others, we should seek a physical process that can 
suppress star formation in central galaxies hosted by intermediate 
to low-mass haloes (M h /M e < 10 12 ). 



4 DOWNSIZING IN STAR FORMATION RATE 

The downsizing in galaxy SFR seen in lookback studies most 
closely corresponds to the original definition of DS. There are, 
however, several different forms in which the diagnostics of DS 
in SFR may been cast observationally. In addition, one can iden- 
tify two different trends that might be called "downsizing": first, 
the normalization of the "star forming sequence" of galaxies shifts 
downwards with decreasing redshift; second, galaxies move off the 
star forming sequence and become 'quenched' or passive as time 
progresses. These different behaviors may offer clues to the phys- 
ical mechanisms responsible, e.g., the downward shift of the SF 
sequence might be due to simple gas exhaustion by star formation, 
while "quenching" is presumably due to a more dramatic process 
such as AGN feedback. If downsizing is occuring, this evolution 
should happen in a differential way, with more massive objects be- 
ing quenched earlier and/or more rapidly. In order to probe these 
different possible "paths" for downsizing, we will consider several 
different ways of slicing and plotting the distribution function of 
SFR as a function of stellar mass and redshift: 1) the 2-d distribu- 
tion of stellar mass and SFR in several redshift bins (2) the average 
SFR as a function of stellar mass, plotted in redshift bins (3) the 
SFR density contributed by objects of different stellar masses, as a 
function of redshift (4) the evolution of the stellar mass function of 
active vs. passive galaxies. 
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Figure 4. The 2-dimensional distribution of specific star formation rates versus stellar mass for the three different SAMs at different redshi fts, and for a compi- 
lation of observations. Mod el ga laxies with SSFR < 10~ 13 yr _1 have been assigned SSFR = 10~ 13 yr _1 . Observational data are fromlschiminovic h et alj 
Noes ke et al ] l2007l) and fSantini et aljj2009h . Blue, red and green lines indicate the observed z = "star-forming" sequence, the "quenched" sequence 
and the "green valley", respectively. 



Star formation rates are estimated using different observa- 
tional tracers, such as H-q emission lines, UV, mid- and far-IR 
emission, and radio. SFR may also be estimated by fitting SF his- 
tories to multi-wavelength broadband SED's, in a similar manner 
used to estimated stellar masses. SFR estimates are impacted by 
many of the same sources of uncertainty as stellar mass estimates 
(such as propagated errors from photometric redshift uncertainties 
and sensitivity to the assumed stellar population models, stellar 
initial mass function, and star formation histories), and also each 
tracer carries its own set of potential problems. For example, SFR 
estimates based on emission lines such as H-a are metallicity de- 



pendent and (typically fairly large) corrections for dust extinction 
must be applied. A potential advantage to this approach is that 
dust corrections can be fairly reliably estimated from the Balmer 
decrement; however, these measurements are currently impractical 
at high redshift as they would require highly multiplexed, deep NIR 
spectroscopy. SFR estimates based on the UV continuum alone suf- 
fer mainly from the very large and uncertain dust corrections (ex- 
tinction estimates based on the UV spectral slope, while widely 
used, are quite uncertain). Estimates based on the mid-IR (e.g. 24/i) 
suffer from highly uncertain k-corrections (as strong PAH features 
move through the observed bandpass), potential strong contamina- 
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Figure 5. Ave rage SFR of galaxie s in bins of stellar mass and redshift. Data are from iDro rv & Al vare J 12008L red open sq uares'!. iBell et alj 12007L black 
filled squares). iNoeske et aTM2007|, magenta open tri angles), [chen et al. 1 2009, red crosses^Martin e t al.1 j2007l blue asterisks). lElbaz et alj j2007l cyan stars), 
iDaddi et alj <2007t blue crosses), Dunne et al. (2008, filled and open blue diamonds), and lSantini et alj j2009l green filled circles, open circles indicate upper 
limits). Errorbars are shown where provided by the authors. The shaded area represents the confidence region for the lowest redshift bin. Solid, dashed and 
dot-dashed lines refer to the MORGANA, WDL08 and S08 models respectively. Model predictions have been convolved with the errors on stellar mass and 
SFR. Top row: all model galaxies are included in the average; second through fourth rows: Only active (SSFR > 10~ n yr _1 ) model galaxies have been 
included in the average. The results for active and all galaxies are nearly indistinguishable for the high redshift (z ^ 1) bins, which is why we do not show 
both cases. 



tion by AGN, uncertainties in the IR SED templates (due to our lack 
of knowledge about the composition and state of the emitting dust), 
and possibly contamination by heating from old stellar populations. 
Measurements of the longer wavelength thermal IR, near the peak 
of the dust emission (~ lOO/i) offer perhaps the most promising 
approach to obtaining robust estimates of total SFR. These are, 
however, currently available only for a small number of very IR- 
luminous galaxies. Moreover, all these indicators are usually cali- 



brated on local galaxy samples, and the systematics connected with 
applying them to higher redshift are poorly known. 

The observed SFRs used in this Section have been ob- 
tained from UV + Spit zer 24»m fe ell et alj 1 20071 : IZheng et all 
120071). Spitzer 24/im dConselice et alj 12007V) . GALEX FUV 
i Schiminovich et all |2007|K emission lines + Spitzer 24/xm 
i Noeskeetal] l2007t). SED-fitting continuum at 2800 A 
i Drorv & Alvarez! 120081 : iMobasher et ail |2009t) , GALEX FUV 



10 Fontanot et al. 



+ Spitzer 24/im dMartin et al.l l2007h . Balmer abs orption lines 
dChen et al.l |2009|). SED-fitting + Spitzer 24^m dSantini et al] 
and radio founne et al . 2008). 

The comparison of models and data is also made difficult by 
the complex selection criteria involved. Most SFR estimates used 
here have poor sensitivity to sources with low SFRs, leading to 
many upper limits; for instance, SFR estimates for passive objects 
are poorly constrained by SED fitting techniques. Several authors 
have attempted to correct for incompleteness by stacking images 
of objects with similar masses to obtain deeper detections, or by 
using only galaxies with active star formation to compute the av- 
erage. A proper comparison should take into account the selection 
effects of each dataset; however, systematics are large and poorly 
understood, so a detailed comparison at this stage is of doubtful 
utility. With all these caveats in mind, we compare our models to 
the data at face value, trying again to assess whether DS is seen in 
the data and to what extent models are consistent with available ob- 
servations. Moreover, analogously with stellar masses, we convolve 
model SFRs with a log-normal error distribution; for its amplitude 
we use a value of 0.3 dex, roughly equal to the median formal error 
of SFRs in GOODS-MUSIC. In the light of what said above, this 
estimate is clearly naive, but it allows us to determine the gross ef- 
fect of (random) uncertainties in SFR determinations. We find that 
our results are fairly insensitive to the inclusion of this error. 

In Fig.|4]we show the 2-d distribution of specific SFR (SSFR) 
as a function of stellar mass, for several redshifts from z = 2 to 
0, for all three models and for a compilation of observational dat a 
dSchiminovich et al.ll2007l ; lNoeske et alj2007l ; [Santini et alj|2009h . 
All model galaxies with SSFR< 10 -13 yr _1 have been assigned 
SSFR = 10~ 13 yr -1 (this causes the thin quenched sequence at 
the bottom of each panel). In each panel, we plot the locations 
of the "star forming" and "quenched" sequenc es at z ~ from 
local observations based on SDSS+GALEX dSalim et al] |2007| ; 
ISchiminovich et aT]|2007l) . and of the so-called "green valley" that 
divides the two sequences. We see that all three models show quali- 
tatively similar behavior. Perhaps the clearest discrepancy between 
the models and data is that the SSFR of low-mass galaxies in all 
three models are the same as or, in the case of MORGANA, even 
lower than those of massive galaxies, while in the observations a 
clear trend is seen such that lower mass galaxies have higher SSFR. 
In the models, the slope of the SF sequence does not appear to 
change significantly over time between z ~ 2 and z ~ 0, while 
the normalization of this sequence decreases over time. Also in all 
three models, there are few if any massive passive galaxies in place 
at z ~ 2; it remains to be seen whether this is in conflict with 
observations. 

In Figure [5] we show the evolution of the average SFR of 
galaxies as a function of stellar mass, for e ight redsh i ft bin s 
from z ~ . 3 to ~ 3.5; data are taken from iBell et alj 



Noeske et al]d2007l).lMartin et al ld2007l) . lDrorv & Alvarez! 
Chen et alj d2009l). ISantini et al. I (12003) and iDunne et al.|j2008l) . 



2007) 



2008) 



For the GOODS-MUSIC data dSantini et al]|2009l) we show with 
open (filled) circles the bins where, according to the authors, the 
incompleteness is (is not) significant; open symbols are then upper 
limits to the average SFR. In order to illustrate the redshift evo- 
lution of this quantity, in all panels the shaded cyan/yellow area 
represents the confidence region of the z ~ observations. In 
the top row, we show redshifts 0.3-0.7, and show model results 
in which we average over all galaxies. In the second row, we re- 
peat the redshift bins 0.3, 0.5, and 0.7 but this time include in the 
model averages only star-forming galaxies (defined here as having 



SSFR> 10" 



yr 



). The remaining panels show model averages 



for active galaxies only, for higher redshifts 1 < z < 3.5. We only 
show the low redshift bins for both active and all galaxies because 
it is only at these redshifts that there are any significant difference 
in the results. We can see, however, that at low redshift, the inclu- 
sion of passive galaxies causes a turn-over in the average SFR at 
high masses in the WDL08 and S08 models. 

We first note again the good agreement between the results 
of the three different SAMs seen in figure [5] a result that we 
did not necessarily expect given the different implementations of 
star formation and feedback. Regarding the comparison with ob- 
servations, we find that the average SFR of low-mass galaxies 
(M* £ 10 'Mq) is underestimated by the models at all redshifts, 
as we already noted from Figure [4] The average star formation 
rates for massive galaxies generally lie near the middle of the 
range of different observational estimates at low redshift, and near 
the lower envelope of observation al estimates at higher redshift 
(z ^ 2). Several previou s studies l lElbaz et al] |2007| ; iDaddi et"al] 
120071 : ISantini et al.ll2009h compared the predictions of a slightly 
different version of the WDL08 models with a s ingle observationa l 
estimate of the SFR as a function of stellar mass. lElbaz et al.ld2007l) 
found that the model predictions were lower than th eir observa- 
tional esti mates at z ~ 1 by abo ut a factor of two, while lDaddi et al] 
d2007l) and lSantini etafl d2009l) found that the models were low by 
a factor of ~ 5 at z ~ 2. Our results are entirely consistent with 
their findings, but we also see that (as already noted above) the dis- 
persion in different observational estimates of the average SFR at 
fixed stellar mass is as large, or larger, than the discrepancy between 
the model predictions and the observational estimates of these pre- 
vious studies. 

MORGANA produces too few massive, passive galaxies at late 
times, resulting in an overestimate of the SFR of massive objects a t 
low redshift. This was studied in more detail in lKimm et al ] d2008l) , 
and is due to a less efficient, or delayed, quenching of the cooling 
flows in massive haloes via radio mode feedback. 

Similar conclusions can be reached by considering the SFR 
density, as a function of redshift, contributed by galaxies of differ- 
ent stellar mass (Fig. [6}. We used the K-ba nd selected GOODS - 
MUSIC catalogue, complete to K < 23.5 dGrazian et al. lEH), 
to compute the SFR density as a function of stellar mass. Follow- 
ing the discussion in lFontana et al] d200fj| ). we translated the mag- 
nitude limit into a stellar mass limit, and computed SFRs either 
with SED fitting using photometry from the NUV to the mid-IR, or 
with Spitzer 24/im fluxes when available. We also plot several other 
data sets from the literature : local points fr om SDSS+GALEX from 
ISchiminovich et al] d2007h ; the results of IConselice et al] d2007l ). 
based on the Palomar/DEEP2 Survey; estimates from st acked 24 
^m flux from the COMBO- 17 sur vey feheng et alif2007h : and es- 
timates from UV luminosity alone dMobasher et alj2009f) . 

The three models again give fairly consistent results, although 
the predictions diverge in the higher mass bins. All three models 
show a gentle decline in the SFR density for low mass galaxies, 
and if anything a somewhat flatter behavior for the SFR density 
in massive galaxies. This time the model predictions agree well 
with the observations for small mass galaxies, because the higher 
number of small galaxies compensates for the lower SSFR of the 
objects. 

Another way to characterize DS in star formation is by di- 
viding galaxies into active (blue) and passive (red) populations, 
then computing the two stellar MFs or, alternatively, the i f -band 
luminosity functions. As p ointed out bylBorch et al] d2006l) . using 
the COMB017 sample, and lBundv etaU d2006l) . using DEEP2, the 
two mass functions cross at a characteristic mass which arows with 
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Figure 6. SFR density contributed by galaxies of different stellar mass 
(stell ar mass is measur ed at the redshift of t he plotted points). Dat a are 
from lZheng et alj 12007} blue fille d diamonds) ; lMobasher et alj<2009l cyan 
stars); IScMrnTnovich et alj J20071 magenta open circles); IConselice et alj 
red open squares) and from GOODS-MUSIC (green open and 
filled circles). Solid, dashed and dot-dashed lines refer to the MORGANA, 
WDL08, and S08 models respectively. Model predictions have been con- 
volved with errors in stellar mass and SFR as explained in the text. Note 
that the lower panels do not represent a mass sequence, in order to compare 
model predictions with observational determinations. 



redshift. Instead of using the color criterion we divide our sam- 
ple into passive and a ctive galaxies using a thr eshold value for the 
SSFR of KT 11 yr" 1 dBrinchmann et al]2004l) . Figure[7]shows the 
evolution of the stellar MFs of active and passive galaxies as pre- 
dicted by the three models. The MF of active galaxies show almost 
no evolution since z ~ 2, whereas most of the evolution of the 
MF is due to the build-up of the passive population; this is quali- 
tatively consistent with the observational resu lts. However, obser- 
vations jBundv et alJl2006l ; lBorch et alj|2006h show that the stellar 
MF of red (passive) galaxies peaks at ~ lO^M© and decreases 
at lower masses. In other words, in observed samples, low mass 
galaxies are predominantly blue (active), while in our models the 
low-mass slope of the SMF is nearly the same for active and pas- 
sive galaxies. This result still holds when galaxies are divided using 
colours rather than SSFR, and this marks another discrepancy be- 
tween models and data for small galaxies. 

In Fig. [8] we show the stellar mass-weighted integrals of these 
functions, i.e., the stellar mass density contained in the active and 
passive populations, as a function of stellar mass and redshift. In 
all models, the stellar mass density (SMD) is dominated by actively 
star forming galaxies at high redshift, with the SMD contributed by 
passive objects growing rapidly at z 1. These results a re in quali- 
tative agreement with observational results at z ;$ 1 (e.g. lBell et alj 
1 2007b . Observational results at higher redshift will soon be avail- 
able from ongoing and future surveys. 

Up until this point in this Section, we have discussed the 
model-data comparison without assessing whether either the pre- 



Figure 7. Stellar MFs of active (SSFR> 10" 11 yr" 1 , upper panels) and 
passive (SSFR< 10~ n yr - lower panels) galaxies. Solid, dashed, dot- 
dashed and long-short dashed lines refer to z = 0, 0.4 < z < 0.6, 1.0 < 
z < 1.3 and 1.6 < z < 2.0, respectively. Different columns show the 
three different models, as labeled. 



MORGANA 



WDL08 



S08 




Figure 8. Stellar mass density of active (SSFR> 10" 11 yr -1 , blue solid 
lines) and passive (SSFR< 10" 11 yr -1 , red dashed lines) galaxies in bins 
of galaxy stellar mass, as a function of redshift. Model predictions have 
been convolved with a Gaussian error on log M* with a standard deviation 
of 0.25 dex. 
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Figure 9. Archaeological DS. Observational constraints on the mean mass- 
weighted stellar age at z = as a function of stellar mass. Solid, dashed and 
dot-dashe d lines refer to t he MO RGANA, WDL08, S08 models respectively. 
Data from lGallazzi et alj fe005l) . 



dieted or observed behavior constitutes "downsizing". The DS-like 
differential evolution would be seen as an earlier accumulation of 
massive passive galaxies in figure[4] and as a flattening of the slope 
of the stellar mass-SFR relation in figure[5]with increasing time. In 
both figures[4]and[5]we see a clear downward shift over time of the 
star-forming sequence both in the observations and in the models. 
Given, however, the significant discrepancies seen between differ- 
ent datasets and different SF indicators, and the possible incom- 
pleteness of the observations of low-mass passive galaxies at high 
redshift, we feel that it is difficult to claim that there is currently 
robust evidence for this differential evolution (DS) in the data in 
either Figure. Once again, however, the models if anything show a 
reverse downsizing trend, with passive low-mass galaxies appear- 
ing earlier than massive, passive galaxies. In Figure|6] the signature 
of DS would be a more rapid drop, with decreasing redshift, of the 
SFR density for more massive galaxies. Alth ough some previou s 
studies have claimed to see such an effect (e.g. ljuneau et alj|2005l) . 
the observational compilation that we have shown here does not 
show clear evidence for this differential decline. The one bin in 
which a mar kedly sharp decline is seen (the highest mass bin, see 
discussion in lSantiniet"ai]|2009h may be affected by cosmic vari- 
ance. Once again, the data appear to be consistent with a constant 
rate of decline in SFR density for galaxies of all masses. 



5 ARCHAEOLOGICAL DOWNSIZING 

In this section, we focus on the relation between the z — galaxy 
stellar mass and the average age of the stellar population (the ar- 
chaeological DS discussed in Section [JJ. In Figure [9] we com- 
pare the stellar mass-weighted age of the stellar populations in 
galaxies as a function of their stellar mass (at z — 0) as pre- 
dicted by the three models with the observational estimates from 



iGallazzi et al.1 J2005h . They use high-resolution SDSS spectra to 
obtain estimates for the ages and metallicities of ~ 170, 000 galax- 
ies with Mi, > 1O 9 M0. They measure these by comparing a set of 
absorption features in the spectra (in particular the Lick indices and 
the 4000 A break) to a grid of synthetic SEDs covering a wide range 
of plausible star formation histories and metallicities. Both the cho- 
sen star formation histories and stellar population synthesis codes 
adopted are a likely source of systematic uncertainty in these esti- 
mates. Moreover, corrections must be made for in- filling by emis- 
sion l ines in the age-sensitive spectral features (see IGallazzi et al] 
120051 for a complete discussion on how this correction was applied. 

Our results show that the model massive galaxies are old, in 
agreement with the observations. However, two out of three mod- 
els predict only a mild trend in age from high mass to low mass 
galaxies, in conflict with the steeper trend seen in the observational 
estimates (as already pointed out by S08). MORGANA behaves like 
models without AGN feedback, which produce an inverted trend 
(in which massive gala xies are younger than low-mass galaxies); 
ICroton et al] ( l2006l) and lDe Lucia et al]j2006h showed that includ- 
ing the "radio mode" AGN feedback makes the massive galax- 
ies older, improving the agreement with the observed trend. Once 
again, it is low-mass galaxies that are discrepant, in the sense that 
they form too early and thus have ages that are too old. 

The inverted trend predicted by MORGANA is mainly due to 
two different physical processes. The younger ages of massive 
galaxies are related to the inefficient quenchi ng of cooling flow s 
in massive halos at z < 1 (see the discussion in lKimm et alj2008h . 
The resulting higher level of star formation implies younger ages 
with respect to WDL08 and S08. The older ages of intermediate- 
to-low mass sources are likely due to th e enhanced cooling at high- 
redshift discussed in IViola et al] f2008l) . and to the associated en- 
hanced star-formation at early times. 

We note that the observational estimates are closer to be- 
ing luminosity - weigh ted more than stellar mass-weighted — 
|Pe Lucia et al] d2006t) showed that light- weighted ages show a 
stronger trend with stellar mass — and also that the ages based on 
absorption line indices (mainly Balmer lines) tend to actually re- 
flect the age of the most rece nt star formation episode, rather than 
the luminosity-weighted age dTrager et al.l2000bl , l2008h . However, 
iTrager & Somerville] ]2008l) find that when these observational bi- 
ases are modeled by extracting line strengths for the SAM galaxies 
in the same way as is done for the observations, this effect cannot 
fully account for the discrepancy between the model ages and the 
observed ages for low-mass galaxies. 

One can even go a step further and att empt to extract sta r for- 
mation histories from galaxy spectra (e.g. IPanter et al .Il2007f ) and 
to construct an average SF history for galaxies binned in terms of 
their ste llar mass at z = . We compare our models with the re- 
sults of IPanter etall d2007l) . who applied the MOPED algorithm 
to high resolution (3 A) spectra from the SDSS. This algorithm 
is similar in spirit to the SED-fitting we described in sec. [3] but 
it treats the SFR as a free parameter (defined on an 11 -bin grid), 
thus allowing for the reconstruction of the star formation history of 
galaxies. We stress that th e se me asurements come with numerous 
uncertainties. IPanter et"aT, I d2007t) showed that their reconstructed 
SF histories depend strongly on the input assumptions. In partic- 
ular, they demonstrated that the largest systematics are related to 
the chosen spectrophotometric code, stellar population model, the 
assumed IMF, the dust attenuation prescription, and the calibration 
of the observed spectra. 

These observationally derived SF histories are shown in 
Fig. [TO] where we plot the cosmic SFR density in bins of z — 
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Figure 10. Cosmic SFR density in bins of z = stellar mas s, as a func- 
tion of redshift. Red squares show observational estimates bv lPanter et alj 
120071) . Solid and dashed lines refer to the MORGANA and WDL08 models 
respectively (uncertainties on stellar mass and SFR have been included). 

stellar mass as a function of redshift. In each panel we renormal- 
ize both the data and the model predictions to the observed value 
at 2 = 0.0844 in order to highlight the differences in the shapes. 
For technical reasons, we cannot easily extract the star formation 
histories for galaxies selected by present-day stellar mass from the 
S08 models. We therefore limit this final comparison to the two 
other models. Fig. [TO] shows that both SAMs considered here fail 
to reproduce the observed trend in detail. Small galaxies form too 
large a fraction of their stars (compared to the observational esti- 
mate) at high redshift. For more massive galaxies, the SFR density 
evolution seems fairly well reproduced by MORGANA, while in the 
WDL08 model too few stars are produced in massive galaxies at 
low redshift. 



6 DISCUSSION & CONCLUSIONS 

We have presented a systematic comparison of semi-analytic mod- 
els (SAMs) of galaxy formation with observations of local and 
high-redshift galaxies that have been claimed to show so-called 
"downsizing" trends. We had several goals: (i) to re-assess the ro- 
bustness of the claims of observed downsizing in the literature, 
based on an extensive comparison of different observational data- 
sets ii) to see if a consistent picture is painted by the different ob- 
servational "manifestations" of downsizing iii) to test to what ex- 
tent the predictions of hierarchical models of galaxy formation, set 
within the ACDM framework, are consistent with these observa- 
tional results. 

In order to test the general paradigm of galaxy formation 
within the hierarchical picture rather than a specific model im- 
plementation, we considered predictions from three independently 
developed SAMs (WDL08, MORGANA and S08). We used physi- 
cal quantities (stellar masses and SFRs) derived from observations 



to avoid confusion related to differences arising from the spectro- 
photometric codes and dust models used by the three SAMs. Of 
course, we cannot avoid these issues since the observational esti- 
mates of stellar masses and SFRs still depend on stellar popula- 
tion models and contain assumptions about dust content, metallic- 
ity, star formation history, and IMF. 

Despite significant differences in the recipes adopted in the 
three models to describe the physical processes acting on the bary- 
onic component, the predictions are remarkably consistent both for 
the evolution of the stellar mass and for the star formation history. 
This is encouraging, in that it suggests that our results are relatively 
robust to the details of the model assumptions. 

We summarize our findings in terms of the three different 
manifestations of downsizing that we considered here. We remind 
the reader that, in all cases, the signature of downsizing is that mas- 
sive galaxies formed (or were assembled) earlier and more rapidly 
than lower-mass galaxies. 

DS in stellar mass: (i) We do not see robust evidence for dif- 
ferential evolution of the stellar mass assembly in the observations, 
i.e., the data are consistent with an increase in stellar mass density 
at the same rate for all stellar mass bins, (ii) We find that the mod- 
els roughly reproduce the evolution of the space density of massive 
galaxies when their predictions are convolved with a realistic es- 
timate for the observational error on stellar masses. At the same 
time, all models predict almost no evolution in the number density 
of galaxies of mass ~ 10 10 Mq since z ~ 2, at variance with 
real galaxies whose number density evolves by a factor of ~ 6 in 
the same redshift interval. Put another way, the models (which are 
normalized to reproduce the stellar mass function at z = 0) over- 
produce low-mass galaxies relative to observations at high redshift 
(z > 0.5). 

DS in SFR: (i) We find that different estimates of SFR as a 
function of stellar mass from different methods show large system- 
atic offsets as well as differences in slope. Based on the available 
observational compilation, we do not see conclusive evidence for 
differential evolution of the SFR or SFR density for galaxies of dif- 
ferent mass. It may therefore be premature to reach any firm conclu- 
sions about whether these observations in fact show the signatures 
of DS. (ii) The models roughly reproduce the increase of the aver- 
age SSFR and SFR density of galaxies up to z ~ 4 though with a 
possible systematic underestimate, the weak evolution of the stellar 
MF of actively star-forming galaxies and the build-up of the pop- 
ulation of passive galaxies at z < 2. However, the mass function 
of passive galaxies has a much steeper small-mass-end slope than 
the data, and low-mass galaxies are too passive (have too little star 
formation) at all probed redshifts. 

Archaeological DS: (i) The data do clearly show the trend of 
massive galaxies being older than low-mass galaxies. However, this 
trend may arise in part from biases related to the SFR reconstruc- 
tion algorithms, (ii) The WDL08 and S08 SAMs qualitatively re- 
produce the observed trend, in that low-mass galaxies are younger 
than high-mass ones. However, the slope of the mean stellar pop- 
ulation age vs. stellar mass trend is much shallower in the models 
than in the data. Some, though probably not all, of this discrepancy 
may be related to observational biases in the age estimates. The 
SAMs do not agree well with the detailed star formation histories 
as a function of z = stellar mass extracted from galaxy spectra; 
low mass galaxies form too large a fraction of their stars at early 
times, and high mass galaxies (at least in the WDL08 models) do 
not have enough star formation at late times. 

Massive galaxies have long been considered one of the main 
challenges for hierarchical models. The introduction of so-called 
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"radio mode" AGN feedback helps keep massive galaxies from 
forming stars down to z = 0, so that red and old massive galax- 
ies are now produced by the latest generation of SA Ms. We find 
that when the stell a r mass errors are accounte d for teorch et al.l 
12004 lBaugr]|2006l : iKitzbichler & White! |2007|) . discrepancies in 
the number densities of massive galaxies weaken or disappear. A 
number of problems still affect model predictions for the most mas- 
sive galaxies: according to the results shown above, their evolution 
since z ~ 1, which is driven by mergers, is marginally inconsistent 
with the data (Figure|2](. Models may also underestimate the num - 
ber of massive galaxies at z > 2 (see also lMarchesini et alj|2008h . 
Depending in part on which observational estimates turn out to be 
correct, at least in some of the models the SFR in massive galaxies 
at high redshift may be too low. These residual discrepancies may 
be solved by better modeling the known processes: the implemen- 
tation of AGN feedback is still extremely crude. The merger-driven 
evolution at z < 1 may be slowed down by scattering of stars 
into the diffuse stellar component of ga l axy groups and cluster s 
dMonaco etal]|2006r . IConrov et al]|2007l ; ISomerville etai]|2008h . 
Moreover, better modeling of chemical evolution is needed to ad- 
dress what may be the most severe challenge for massive galaxies, 
the chemo-archaeological DS. 

At the same time, we find serious discrepancies in the model 
predictions for less massive galaxies in the range 10 9 — 10 11 Mq 
in stellar mass: they form too early and have too little ongoing 
star formation at later times, so their stellar populations are too 
old at z = 0. Their number density is nearly constant since 
2 ~ 2, while observations show that it grows in time. Their SSFR 
is too low compared with observational data. The low-mass end 
slope of the SMF of passive galaxies is too steep, again indicating 
an excess of low-mass passive galaxies. Part of this discrepancy 
cou ld be due to the over-quenching problem for sa t ellite galax- 
ies IWeinmann et aljfeoodllvan den Bosch et~al]|2008l : rKimm et al.l 
l2008l : lGilbank & Baloghl2008h . which is caused by the assumption 
in all three SAMs that the hot halo is instantly stripped from satel- 
lites as they enter a larger host halo, thus shutting off any further 
cooling onto satellite galaxies. However, as we showed in Figure[3] 
the problematic galaxies are predominantly central galaxies in DM 
haloes with relatively high circular velocities, ~ 100 — 200 km/s. 
Therefore, mechanisms that only impact satellite galaxies (such as 
ram pressure stripping), or that only work on very low mass haloes 
(like photo-ionization or, probably, pre-heating) are not viable so- 
lutions to this problem. 

The paradox is that we must suppress the formation of low- 
mass galaxies in order to fit the low-mass end of the stellar mass 
function or the faint end of the luminosity function within the CDM 
paradigm. In the three models presented here, as in probably all 
ACDM models in the literature, this is currently accomplished by 
implementing very strong supernova feedback in low-mass galax- 
ies. Not only is it unclear that this strong SN feedback is physi- 
cally motivated or in agreement with direct observations of winds 
in low-mass galaxies, but apparently it does not produce the correct 

formation histories for low-mass galaxies. 

Another hin t may come fr om chemical DS: iMaiolino et al.l 

d2008l) (see also ILo Fard 120091) showed that the models predict 
that small galaxies at high redshift are much more metal-rich than 
observed galaxies at these mass scales. This could either indicate 
that the metals are efficiently removed from these galaxies, e.g. by 
winds, or that star formation (and therefore metal production) is 
inefficient. 

Thinking of a plausible mechanism that can suppress the for- 
mation of galaxies in small but compact DM haloes at high z is 



not so easy: their density is too high and their potential wells are 
too deep to suppress star formation with heating from an external 
UV background, while massive galactic winds should not destroy 
galaxies of the same circular velocity at lower redshift. Therefore, 
the discrepancies discussed above call for a deep re-thinking of the 
feedback schemes currently implemented in SAMs. Alternatively, 
the problem may be related to the nature of DM; if this is not com- 
pletely collisionless, the density profiles of small D M haloes may 
be sign ificantly different from the generally assumed lNavarro et al.l 
([19960 form, and this would influence cooling rates and infall times, 
galaxy sizes and SFRs. 

All model predictions discussed in this paper and the data 
shown in Fig. 2 are available in electronic format upon request. 
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